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ABSTRACT 

Ten-eleven translocation (TET) family enzymes con- 
vert 5-methylcytosine to 5-hydroxylmethylcytosine. 
However, the molecular mechanism that regulates 
this biological process is not clear. Here, we show 
the evidence that PGC7 (also known as Dppa3 or 
Stella) interacts with TET2 and TET3 both in vitro 
and in vivo to suppress the enzymatic activity of 
TET2 and TET3. Moreover, lacking PGC7 induces the 
loss of DNA methylation at imprinting loci. Genome- 
wide analysis of PGC7 reveals a consensus DNA motif 
that is recognized by PGC7. The CpG islands 
surrounding the PGC7-binding motifs are 
hypermethylated. Taken together, our study demon- 
strates a molecular mechanism by which PGC7 
protects DNA methylation from TET family enzyme- 
dependent oxidation. 

INTRODUCTION 

DNA methylation at the 5 position of the cytosine pyr- 
imidine ring (5mC) in genomic DNA is an important epi- 
genetic mark that regulates gene transcription during 
numerous physiological and pathological processes 
(1-5). In particular, during early embryogenesis, DNA 
methylation is dynamically regulated to precisely control 
transcription for defining cell fate (6-8). A global DNA 
demethylation occurs in the paternal pronucleus of a 
zygote, which is likely to remove epigenetic barriers that 
restrict gene transcription and developmental potential 
(9-11). This active DNA demethylation process is 
mediated by TET family enzymes that oxidize methyl 
groups at the 5 position of cytosine pyrimidine rings 
into hydroxymethyl groups (5hmC) in the presence of 
2-oxoglutarate (2-OG) and Fe(II) (11-13). Further oxida- 
tion, probably catalyzed by these oxygenases, generates 
5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) 
(14,15). Both 5fC and 5caC are relatively unstable on 
the chromatin (14,15) and are recognized by thymine- 
DNA glycosylase (TDG) that excises the base of 5fC or 



5caC from the chromatin for the completion of the active 
demethylation process(14,16). 

Among these TET enzymes, TET3, but not TET1 and 
TET2, is specifically expressed in zygotes (17-21). Thus, it 
is TET3 that mediates global DNA demethylation in the 
paternal pronucleus of zygote (17-20). Interestingly, 
TET3 is located in both maternal and paternal pronucleus 
although TET3 associates with the paternal pronucleus 
more tightly than with the maternal pronucleus (19). 
How can TET3 specifically remove DNA methylation 
mark in paternal pronucleus but not maternal pronucleus? 
Recent studies indicate that PGC7, a maternal factor, par- 
tially protects DNA methylation status in the maternal 
pronucleus (19,20). PGC7 is a 159-residue nuclear poly- 
peptide that is mainly expressed in germ cells and pluri- 
potent cells (22-24). Interestingly, the PGC7 gene only 
exists in mammals and evolved very quickly (24-26). 
PGC7 +I ~ mice could give birth to PGC7 null offspring 
without obvious developmental defects. However, 
oocytes from PGC7~ ~ mice induce early embryogenesis 
arrestment at the 4-cell stage following fertilization (26- 
28). Detailed analysis on the PGC7-deficient zygotes 
suggests that loss of PGC7 induces TET3-dependent 
DNA demethylation in the maternal pronucleus, suggest- 
ing that PGC7 is an important maternal factor for pro- 
tecting DNA methylation (19,20). It has been shown that 
PGC7 binds to dimethylated histone H3K9 (H3K9me2) in 
the heterochromatin region through its N-terminal tail 
(19). Interestingly, in this region, PGC7 contains a 
putative DNA-binding domain (26). Thus, the molecular 
mechanism by which PGC7 suppresses TET enzyme- 
induced DNA demethylation could be more complicated. 
Moreover, a large portion of DNA methylation remains 
intact even in the absence of PGC7 (28), suggesting that 
PGC7 only partially protects methylated DNA from 
TET3-dependent demethylation. 

To examine the molecular mechanism of PGC7 in pro- 
tecting DNA methylation, we analyzed the domains im- 
portant for the function of PGC7. Although the homology 
of PGC7 is relatively low among different mammalian 
species, it contains a conserved DNA-binding domain at 
the N- terminus (26). Here, we characterized human PGC7 
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and found that the N-terminal DNA-binding domain of 
PGC7 interacted with TET2 and TET3. This interaction 
suppresses the enzymatic activity of TET2 and TET3 both 
in vitro and in vivo. Moreover, the DNA-binding domain 
of PGC7 recognizes a consensus DNA sequence and 
protects the surrounding CpG methylation in the chroma- 
tin. These results provide a novel molecular mechanism by 
which PGC7 suppresses TET family enzymes-induced 
DNA demethylation. 



MATERIALS AND METHODS 

Plasmids and antibodies 

Full-length complementary DNA (cDNA) of TET1, 
TET2 and TET3 were cloned into pS-FLAG-SBP (SBP) 
vector. For protein co-immunoprecipitation experiments, 
PGC7 and PGC7 deletion mutants were cloned into SBP 
vector as well as pCMV-Myc vector. TET2 deletion 
mutants were cloned into SBP vector. TET3CD was 
subcloned into pCMV-HA vector and SBP vector. 

Anti-FLAG (M2) antibody was purchased from Sigma. 
Anti-HA and anti-Myc monoclonal antibody were 
purchased from Covance. Anti-5hmC polyclonal antibody 
was purchased from Zymo Research. Anti-5mC monoclo- 
nal antibody was purchased from Calbiochem. Rabbit anti- 
human PGC7 antibody was raised against full length of 
human PGC7. Rabbit anti-human TET 3 antibody was 
raised against the a. a. 351-668 of hTET3, rabbit anti- 
human TET1 antibody was raised against C-terminus 
(a.a. 1500-2008) of TET1. Rabbit anti-human TET2 
antibody was purchased from Abeam (ab94580). 

Recombinant proteins 

Recombinant proteins were purified from Sf9 insect cells. 
For generating baculovirus, DNA fragments containing 
full-length human PGC7, deletion mutants of PGC7, 
TET2 and TET3CD were subcloned into pFastBac 
vector with a GST or SBP tag. Baculoviruses were 
generated in accordance with the manufacturer's instruc- 
tions (Invitrogen). After Sf9 cells were infected with 
baculoviruses for 48 h, the cells were harvested, washed 
with cold phosphate buffered saline (PBS) three times 
and lysed with ice-cold NETN100 buffer (20 mM Tris- 
HC1 pH 8.0, lOOmM NaCl, 1 mM EDTA, 0.5 % 
Nonidet P-40). The soluble fraction was incubated with 
glutathione-sepharose beads (for GST-tag proteins) or 
streptavidin-conjugated beads (for SBP-tag proteins) and 
eluted with glutathione or biotin. For protein binding 
assay, baculoviruses expressing GST-tagged PGC7 and 
SBP-tagged TET3CD co-infected Sf9 cells. 

Immunofluorescence staining 

Cells grown on coverslips were fixed with 3% 
paraformaldehyde for 20min and permeabilized with 
0.5% Triton X-100 in PBS for 5min at room temperature. 
Cells were then washed with cold PBS and incubated with 
blocking buffer (10% donkey serum, 3% bovine serum 
albumin in PBS containing 0.1% Triton X-100) before 
incubation with primary antibodies overnight in a 



humidified chamber at 4°C. For 5hmC staining, 
permeabilized cells were denatured with 2N HC1 for 
30min, neutralized with 100 mM Tris-HCl (pH 8.5) 
before blocking. After three consecutive 5-min washes 
with PBS, cells were incubated with secondary antibodies 
for 30min. The cells were washed again three times with 
PBS and then mounted onto glass slides and visualized by 
a fluorescence microscope. 

Cell lysis, immunoprecipitation and western blotting 

For immunoprecipitation assays, 293 T cells were lysed 
with ice-cold NETN400 buffer (0.5% NP-40, 50 mM 
Tris-HCl pH 8.0, 2mM EDTA and 400 mM NaCl) con- 
taining lOmM NaF and 50 mM (3-glycerophosphate. The 
soluble fractions were collected and diluted to NETN100, 
then directly subjected to electrophoresis or immunopre- 
cipitation with indicated antibodies followed by western 
blot analysis with indicated antibodies. 

Chromatin immunoprecipitation assay and methylated 
DNA immunoprecipitation assay 

Chromatin immunoprecipitation assays (ChIP) were per- 
formed according to the protocol described by Upstate. 
The 293 T cell DNA was sonicated to an average size 
between 300 and 600 bp. Solubilized chromatin was 
immunoprecipitated with the antibody against TET3 and 
PGC7. Antibody-chromatin complexes were pulled down 
using protein A-sepharose beads, washed and then eluted. 
After cross-link reversal and proteinase K treatment, 
immunoprecipitated DNA was extracted with phenol- 
chloroform, ethanol-precipitated and treated with 
RNase. ChIP DNA was qualified using PicoGreen. 

For methylated DNA immunoprecipitation (MeDIP), 
genomic DNA was extracted using Promega Wizard 
Genomic DNA Purification Kit. After Proteinase K and 
RNase treatment, the genomic DNA was extracted with 
phenol-chloroform, precipitated with ethanol and 
dissolved with Tris-EDTA (TE) buffer. The purified 
genomic DNA was sonicated using Diagenode Bioruptor 
200. The fragment size of DNA was determined by 2% 
agarose gel. The DNA fragment size was 200-800 bp. The 
samples were heat-denatured and mixed with 1 jig 5mC 
antibody. The mixtures were incubated overnight at 4°C 
and then pulled down by protein A-sepharose beads. The 
beads were extensively washed and digested by Proteinase 
K. DNA was further purified by phenol-chloroform 
extraction and ethanol precipitation. MeDIP DNA was 
qualified using PicoGreen. 

ChIP sequencing 

DNA fragments isolated from ChIP were repaired to blunt 
ends by T4 DNA polymerase and phosphorylated with T4 
polynucleotide kinase using the END-IT kit (Epicentre). 
A single 'A' base was added to 3'-end with Klenow. 
Double-stranded adaptors (75 bp with a T' overhang) 
were ligated to the fragments with DNA ligase. Ligation 
products between 200 and 600 bp and were gel purified to 
remove unligated adaptors and subjected to 20 polymerase 
chain reaction (PCR) cycles. Completed libraries were 
quantified with PicoGreen. The DNA libraries were 
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analyzed by Illumina Hiseq2000 high-throughput 
sequencing. The read quality of each sample was 
determined by FastQC software. After pre-filtering the 
raw data by removing sequence adaptors and low quality 
reads, the tags were mapped to the human genome (hgl9) 
by Bowtie software. Parameters settings were listed as 
follows: —v, 3 (reported alignments with at most three 
mismatches), —5, 3 and —3, 7 (trim three bases from 5'- 
end and seven from 3'-end to remove low-quality bases). 
Peak detection was performed using MACS software from 
Galaxy browser (http://www.galaxy.psu.edu). Parameter 
settings were as follows: IgG ChlP-Seq aligned reads were 
used as control file, tag size with 25 bp and band width 
300 bp. The peaks obtained from ChIP sequencing (ChlP- 
Seq) were matched to the annotated reference genome 
(human hgl9) using Cisgenome 2.0. To view the peak 
density and position, Cisgenome 2.0 was used. The 
relative accumulations of tags around transcription start 
sites (TSS) were performed using SEQMINER software. 
The TSS position of each gene was downloaded from 
UCSC genome browser. Integrative Genomics Viewer 
(IGV) was used to analyze the peak distribution in the 
chromosome level. To obtain the binding motif of PGC7, 
the online software Peak-motifs (http://rsat.ulb.ac.be/peak- 
motifs_form.cgi) was used. A set of 30 PCR primer pairs 
(Supplementary Table SI) were designed to amplify 100— 
140 bp fragments from genomic regions showing a wide 
range of signals for TET3 and PGC7 ChlP-Seq as well as 
MeDIP sequenceing (MeDIP-Seq). ChIP quantitative PCR 
(ChlP-qPCR) values reflect two independent ChIP assays, 
and each was evaluated in duplicate by qPCR. For methy- 
lation analysis of the aligned reads of MeDIP-Seq, the 
Batman algorithm has been used (29). Batman algo- 
rithm infers the absolute methylation state for 100-bp 
windows by estimating local sequencing read enrichment 
for methylation taking into account the varying densities 
of methylated CpGs across the genome. Batman output is 
in the form of GFF format, each GFF file represents a 
score that equals the median of methylation states in a 
100-bp window. The score ranges between 0 and 1 accord- 
ing to the level of methylation (percentage of methylated 
CpGs in the specific region). Methylation states of Group I 
or Group II genes were summarized, respectively, 
based on the Batman score at TSS (± 1 kb) (Figure 5B). 
ChlP-Seq data of O-linked N-acetylglucosamine 
(GlcNac) transferase (OGT) was according to our 
previous study (30). 

In vitro 5hmC assay 

To examine the effect of PGC7 on the activity of TET2 
and TET3 in vitro, 0.4 jig double-stranded DNA sub- 
strates with 5-methylcytosine replacing cytosine were 
mixed with 5 ug wild-type recombinant PGC7 or PGC7 
mutant proteins as well as TET3CD or TET2CD proteins 
and incubated with 50 mM HEPES, pH 8, 50 mM NaCl, 
2mM Ascorbic Acid, 1 mM 2-OG, 100 uM FAS (Fe 2+ ), 
1 mM DTT and 1 mM ATP for 1 h at 37°C. The samples 
were subjected to dot blotting assay or PvuRtslI restric- 
tion enzyme digestion. 



Dot blotting assay 

Genomic DNA from 293T cells or the DNA substrate 
from in vitro assays was denatured by 0.2 N NaOH 
and dotted on Hybond-N+ nitrocellulose membrane 
(Amersham Pharmacia Biotech). After drying at 65°C, 
the membrane was blocked with 10% non-fat milk for 
1 h at room temperature followed by 2-h incubation with 
anti-5hmC antibodies at room temperature. After three 
consecutive 10-min washes with Tris-buffered saline with 
Tween®-20 (TBST), the membrane was incubated with 
horseradish peroxidase (HRP)-conjugated goat-anti- 
rabbit secondary antibody for 1 h. The membrane was 
washed again three times with TBST and developed 
using the Enhanced ChemiLuminescence plus (ECL+) de- 
tection system (GE Healthcare). 

PvuRtslI restriction enzyme digestion assay 

CpGenome 5mC DNA standard (Millipore, Cat. # S8005) 
was subjected to in vitro 5hmC assay. The DNA from 
in vitro assay was digested by PvuRtslI restriction 
enzyme (Active Motif) at 22°C. After 30min, the DNA 
samples were subjected to gel electrophoresis or qPCR. 
PCR primers are listed in Supplementary Table S 1 . 

Bisulfite sequencing 

The methylation profiles of Pegl, Peg3, Peg 10 and H19 
were determined by a slightly modified genomic 
sequencing technique after bisulfite treatment. Briefly, 
500 ng genomic DNA isolated from HBL100, or 
HBL100 with PGC7 knockdown or TET3 overexpression 
or both was treated with bisulfite according to the 
protocol of the Zymo EZ DNA Methylation Kit (Kit # 
D5001). The bisulfite-converted DNA was subjected 
to PCR or nesting PCR. The primers are listed in 
Supplementary Table SI. The PCR products were 
separated by 1% agarose gel, purified by gel extraction 
kit and then cloned to a T vector for sequencing. 

Retroviral vectors construction, virus production 
and infection 

Retrovirus short hairpin RNA (shRNA) vectors targeting 
human PGC7 (shPGC7) and Scramble were constructed 
by inserting short hairpin RNA templates into pMSCV- 
neo-U6, a RNAi vector that was constructed in Dr Zhao's 
lab (31). The short hairpin RNA templates specifically 
targeting PGC7 were designed, synthesized and annealed 
as previously reported. The sequences are shown in 
Supplementary Table SI. The shPGC7 and Scramble 
vectors were transfected into packaging cell line 293T 
with two other helper packaging plasmids pMD-MLV- 
OGP (gag-pol) and pVSV-G (env). After 48 h of transfec- 
tion, the cell culture medium was harvested and the viral 
particles were concentrated by ultracentrifugation at 
50 000g for 3h and then resuspended in expansion 
medium. HBL100 cells were infected with retrovirus at a 
multiplicity of infection (MOI) of 10 for 4h in the 
presence of 8 ug/ml polybrene. After infection, HBL100 
cells were allowed to recover for 24 h and then selected 
in expansion medium with 800 ug/ml G418 (Sigma 
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Aldrich) for 1 week and maintained in the medium with 
400 us/ml of G418. 



RESULTS 

Recent study suggests that PGC7 plays an important role 
in protecting DNA methylation during early embryogen- 
esis, especially at several imprinting loci (28). To examine 
the molecular mechanism of this interesting phenomenon, 
we hypothesized that PGC7 might interact with DNA 
methylation and demethylation enzymes, namely DNMT 
family and TET family enzymes. It has been reported that 
DNMT1, 3 a, 3 b and TET3 all regulate DNA methylation 
pattern during early embryogenesis (17,32-35). Thus, we 
examined the interaction between PGC7 and these 
enzymes using co-immunoprecipitation (co-IP) assays 
(Supplementary Figure SI A). Interestingly, we found 
that PGC7 specifically associated with TET3 but not the 
DNA methyl transferase (DNMT) family enzymes. In 
addition to TET3, both TET1 and TET2 can convert 
5mC into 5hmC for potential DNA demethylation. 
Thus, we also examined the interaction between PGC7 
and these two TET enzymes and found that only TET2 
interacts with PGC7 (Figure 1A). Collectively, these 
results suggest that PGC7 selectively interacts with 
TET2 and TET3. 

To examine the details of these interactions, we mapped 
the regions that mediate these interactions. Based on the 
predicted protein folding, we generated a series of trunca- 
tion mutants and found that only the C-terminal catalytic 
DSBH (double strand (3-helix) domain (CD domain) of 
TET2 interacts with PGC7 (Figure IB). Consistently, 
the CD domain of TET3 also interacted with PGC7 
(Figure 1C), suggesting that the CD domains in TET2 
and TET3 are important for the interaction with PGC7. 
To map the binding region in PGC7, a series of 
N-terminal and C-terminal truncation mutants were 
generated. Deletion of the N-terminal a. a. 1-19 (Dl) or 
C-terminal a.a. 96-159 (D7) did not affect the interaction. 
However, deletion of the N-terminal a.a. 1-25 (D2) and 
C-terminal a.a. 91-159 (D8) significantly decreased or 
even abolished the interaction with TET3. Moreover, the 
region encoding a.a. 20-95 was sufficient to interact with 
TET2 or TET3 (Figure ID). Interestingly, this region of 
PGC7 is a putative DNA-binding domain. Thus, it is 
likely that this domain of PGC7 interacts with TET2 
and TET3. In addition, to examine whether PGC7 
directly interacts with TET enzymes, we co-infected Sf9 
cells with baculoviruses encoding GST-PGC7 and SBP- 
TET3 CD domain. These two recombinant proteins 
could be co-purified (Supplementary Figure SIB). Taken 
together, these results suggest that PGC7 interacts with 
TET3 in vitro and in vivo. 

To understand the functional significance of the inter- 
action between PGC7 and TET2 or TET3, we examined 
whether PGC7 regulated TET2 or TET3-dependent 5mC 
oxidation. Accumulated evidence shows that TET 
enzymes oxidize 5mC to 5hmC in the presence of Fe(II) 
and 2-OG (12,13). Thus, we established the in vitro DNA 
oxidation assays to detect 5hmC. Using dot blotting 



assays, we observed that the recombinant CD domains 
of TET family enzymes converted 5mC to 5hmC. 
Surprisingly, when the recombinant PGC7 was supple- 
mented into this in vitro DNA oxidation assay, the recom- 
binant PGC7 repressed the enzymatic activity of the CD 
domains of TET2 and TET3 (Figure 2A and 
Supplementary Figure S2A). Next, we generated the 
recombinant Dl, D3, D7-9 mutants of PGC7 
(Supplementary Figure S2B). As shown in Figure 2B, 
the D3 and D8 mutants of PGC7 that abolish the inter- 
action with TET3 failed to suppress the enzymatic activity 
of the CD domain of TET3, whereas the Dl, D7 and D9 
mutants of PGC7 that still interact with TET3 could 
suppress the TET3-mediated DNA oxidation. These 
results suggest that the interaction between PGC7 and 
TET enzymes is critical for this suppression. Moreover, 
PGC7 failed to suppress TET 1 -dependent oxidation of 
methylated DNA as PGC7 does not interact with TET1 
(Figure 2A). The interaction region in PGC7 is a putative 
DNA-binding domain. To exclude the possibility that 
the inhibition of TET3 is due to the non-specific DNA- 
binding of PGC7, which blocks the 5mC epitope 
recognized by TET3, we included the recombinant 
PALB2 as the negative control. It has been shown that 
PALB2 non- specifically binds both single-stranded DNA 
and double-stranded DNA (36). However, supplement of 
PALB2 in this in vitro DNA oxidation assay could not 
block the TET3-dependent oxidation of methylated 
DNA (Figure 2C). As the putative DNA-binding 
domain of PGC7 directly interacts with the catalytic 
domain of TET3, it is likely that the interaction masks 
the catalytic site of TET3 and thus suppresses TET3- 
dependent oxidation of methylated DNA. We also used 
a different approach to confirm the results. PvuRtslI is 
a restriction enzyme that specifically digests double- 
stranded DNA containing 5hmC but not 5mC (37). 
When 5mC was converted to 5hmC by the CD domain 
of TET2 or TET3, the DNA substrate was digested by 
PvuRtslI. However, in the presence of PGC7, the enzym- 
atic activity of TET2 or TET3 was suppressed, which 
failed to convert 5mC into 5hmC. Thus, PvuRtslI could 
not digest the DNA substrate in the presence of PGC7 
(Figure 2D). We also examined the mutants of PGC7 in 
this assay. Again, the D9 mutant of PGC7 that still inter- 
acts with TET3 and suppresses the enzymatic activity of 
TET3 inhibited the DNA substrate digestion by PvuRtslI. 
However, with the D3 mutant of PGC7 that does not 
interact with TET3, PvuRtslI digested the DNA substrate 
oxidized by TET3 (Figure 2D). Moreover, we quantita- 
tively measured the remaining intact DNA substrate 
following PvuRtslI treatment by qPCR. The results are 
consistent with the agarose gel electrophoresis 
(Figure 2E). Taken together, these results suggest that 
the direct interaction between PGC7 and TET2 or TET3 
suppresses the enzymatic activity of TET2 or TET3. 

Next, we examined whether PGC7 could suppress 
TET2- and TET3-dependent DNA oxidation in vivo. As 
293T cells express TET1 (Figure 1A), depleting endogen- 
ous TET2 and TET3 did not significantly affect the level 
of 5hmC. Thus, we generated a 293T cell line stably 
expressing PGC7 and named it as 293T-PGC7 
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Figure 1. PGC7 interacts with TET2 and TET3. (A) PGC7 interacts with TET2 and TET3, but not with TET1. Co-IP was performed in 293T cells. 
PGC7 and TET1-3 were examined with indicated antibodies. The whole cell lysates (WCL) of 293T was used as the input. An irrelevant IgG was 
used as the IP control. (B) The catalytic DSBH (CD) domain of TET2 interacts with PGC7. Flag-tagged deletion mutants of TET2 were expressed in 
293T cells. Co-IP assays were performed with indicated antibodies. Mock transfected cells were used as the control. (C) The CD domain of TET3 
also interacts with PGC7. (D) The fragment of PGC7 with a.a. 20-95 interacts with TET3 catalytic domain (TET3CD) and TET2 catalytic domain 
(TET2CD). Myc-tagged N-terminus or C-terminus deletion mutants of PGC7 were generated and expressed in 293T cells. Co-IP assays were 
performed with indicated antibodies. 



(Supplementary Figure S3 A). We then expressed the cata- 
lytic domain of TET2 or TET3 in 293T and 293T-PGC7- 
cells. Compared with that in 293T cells, the level of 5hmC 
was remarkably reduced in 293T-PGC7 cells when the 
catalytic domain of TET2 or TET3 was expressed 



(Figure 3A). Moreover, 5hmC was also examined by im- 
munofluorescence staining. The 5hmC positively-stained 
population in 293T-PGC7 cells was significantly less 
than that in 293T cells when the catalytic domains of 
TET2 or TET3 were expressed (Figure 3B and C; 
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Supplementary Figure S3B). We also established 293T 
cell lines stably expressing D3, D8 or D9 mutants of 
PGC7 and named them as 293T-D3, 293T-D8 and 
293T-D9, respectively. Again, the D3 and D8 mutants of 
PGC7 that do not interact with TET2 or TET3 failed to 
suppress the TET2 CD domain or TET3 CD domain- 
induced 5hmC, whereas the D9 mutant that still 
interacts with TET2 or TET3 inhibits the TET2- or 
TET3-dependent 5hmC generation in vivo (Figure 3B 
and C). Collectively, these results demonstrate that 
PGC7 suppresses the enzymatic activity of TET2 and 
TET3 in vivo. 



Previous studies reported that PGC7 is important for 
the protection of DNA methylation mark in the maternal 
pronucleus from TET3-mediated global DNA 
demethylation (19). However, not all the methylated 
CpG loci are protected by PGC7 (28). Lacking PGC7 in 
oocytes induces loss of DNA methylation mark at certain 
imprinting loci (28), suggesting that PGC7 might select- 
ively protect methylated CpG loci from TET3-dependent 
DNA demethylation. To examine whether the interaction 
between PGC7 and TET3 protects the DNA methylation 
status of the imprinting loci, we analyzed epithelial 
HBL100 cell line, a diploid cell line established in vitro 
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from the milk of an apparently healthy woman (38). 
Unlike in 293T cells, few genetic alterations occur in 
HBL100 cells. We studied DNA methylation in HBL100 
cells. Using shRNA, PGC7 was stably down-regulated in 
HBL100 cells (Figure 4A and B). Moreover, we also stably 
expressed TET3 in these cells as the expression of both 
endogenous TET3 and TET2 in HBL100 is very low 
(Figure 4B and Supplementary Figure S4A). We 
examined and compared the DNA methylation status in 
the parental HBL100 cells and cells with down-regulation 
of PGC7 and/or up-regulation of TET3. Consistent with 
PGC7 knockout zygotes, we found that DNA methylation 
was significantly lost in Pegl, Peg3 and Peg 10 loci in 
HBL100 cells with both down-regulation of PGC7 and 
up-regulation of TET3 (Figure 4C). Of note, although 
up-regulation of TET3 significantly reduces the overall 
5mC level (Supplementary Figure S4B), it did not drastic- 
ally change the methylation pattern at the imprinting loci 
without down-regulation of PGC7 (Figure 4C), suggesting 
that the limited amount of endogenous PGC7 may select- 
ively protect these imprinting loci, but not global 5mC in 
HBL-100 cells. As Pegl, Peg3 and Peg 10 loci are maternal 
imprinting loci, we also examined H19 locus, a paternal 
imprinting locus. Again, the DNA methylation at H19 
locus was also lost when cells lacked PGC7 and expressed 



TET3 (Figure 4C). Moreover, using reverse transcriptase- 
PCR (RT-PCR), we found that the transcription of these 
genes significantly increased in the HBL100 cells with 
down-regulation of PGC7 and up-regulation of TET3 
(Supplementary Figure S4C). Thus, these results suggest 
that PGC7 protects DNA methylation from the 
TET3-dependent DNA demethylation process. 
Moreover, we introduced shRNA-resistant PGC7 or the 
D3 mutant back into the cells lacking PGC7 and express- 
ing TET3. Only wild-type PGC7, and not the D3 mutant, 
could suppress the loss of DNA methylation at the Pegl 
locus (Figure 4D). Taken together, these results demon- 
strate that the direct interaction between PGC7 and TET3 
protects the imprinting loci from TET3-dependent DNA 
demethylation. Thus, this human cell model recapitulates 
the DNA methylation protection by PGC7 during early 
embryo development. Moreover, it demonstrates a novel 
molecular mechanism by which PGC7 protects methylated 
CpG from TET enzyme-dependent DNA demethylation. 

As the region of PGC7 mediating the interaction with 
TET3 is a putative DNA-binding domain (26), we asked if 
this putative DNA-binding domain recognizes any specific 
DNA sequence in those loci analyzed above, thus 
determining the region under PGC7 protection against 
TET3 -mediated DNA demethylation. We performed 
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ChlP-qPCR assay and found that PGC7 indeed bound 
these loci. Moreover, like wild-type PGC7, the mutant 
D9, containing only the DNA-binding domain also 
associated with these imprinting loci (Supplementary 
Figure S5). To determine if PGC7 recognizes specific 



DNA sequence, we performed high-throughput ChlP- 
Seq to examine the genome-wide localization of PGC7 
and TET3 in 293T cells. A total of 9480 PGC7 target 
genes and 9392 TET3 target genes were identified from 
the ChlP-Seq analysis. The ChlP-Seq results were 
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validated by using ChlP-qPCR to examine 30 randomly 
picked loci that represent a broad range of ChlP-Seq 
fragment counts (Supplementary Figure S6A). We found 
63% PGC7 target genes were bound by TET3, whereas 
64% target genes of TET3 were bound by PGC7, 
indicating a significant overlap of the PGC7 and TET3 
target genes (Figure 5A). To examine the role of PGC7 
in the protection of DNA methylation, we named those 



genes bound by both PGC7 and TET3 as group I, and the 
genes only bound by TET3 as group II. We examined the 
DNA methylation using MeDIP. Comparing group I with 
group II genes, we found that the DNA methylation level 
in group I genes was significantly higher than that in 
group II genes (Figure 5B). TET enzymes have been 
shown to locate at the TSS (8,30,39,40). Thus, we 
examined the TSS occupied by TET3. The 5mC levels 
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are significantly higher at TSS in the presence of PGC7 
than those without PGC7 (Figure 5C). Moreover, 
chromosome level analysis further indicates that PGC7 
and TET3 co-occupied regions are associated with high 
levels of 5mC, such as chromosome 13 as shown 
(Figure 5D). We also examined the localization of PGC7 
and TET3 at the Pegl, Peg3, Peg 10 and H19 imprinting 
loci with the high-throughput ChlP-Seq analysis. Again, 
PGC7 co-localized with TET3 in these gene loci 
(Figure 5E). To further characterize the DNA-binding of 
PGC7, we analyzed the DNA-binding sequence that is 
recognized by PGC7 using Peak-motifs software (http:// 
rsat.ulb.ac.be/peak-motifs_form.cgi). Based on the PGC7- 
enriched regions, a specific DNA-binding sequence was 
concluded by the software and this sequence was con- 
firmed in the Pegl, Peg3, Peg 10 and HI 9 imprinting 
loci, which are occupied by PGC7 (Figure 5F). 
Electromobility shift assays further confirmed the inter- 
action between PGC7 and DNA with the consensus 
motif (Supplementary Figure S6B). 

Besides the imprinting genes, PGC7 and TET3 co- 
localize at many other gene loci containing the consensus 
DNA sequence recognized by PGC7 (Supplementary 
Table S2). The ontology analysis classified these genes 
into different pathways (Supplementary Figure S6C). 
Interestingly, a number of these genes are tissue-specific 
genes, such as male germline expressing genes and somatic 
genes specifically expressed in hematopoietic linages or 
retina (Supplementary Table S3). Like imprinting loci, it 
has been reported that DNA methylation status in the 
promoter region of these genes could be inherited from 
parental gametes (41). We randomly picked up five 
genes, including male germline expressing gene Piwill, 
Spaca4, Tssk2, hematopoietic gene Fyb and retina- 
related gene Rrh and examined the localization of 
PGC7, TET3 and 5mC. Like those imprinting loci, 
PGC7 co-localizes with TET3 and is associated with the 
high level of 5mC at these loci (Figure 6A). ChlP-qPCR 
was performed for these genes to confirm the ChlP-Seq 
results (Figure 6B). Thus, based on our study, we propose 
that PGC7 protects DNA methylation from TET enzyme- 
dependent oxidation by interaction with TET2 and TET3 
catalytic domain at a set of gene loci (Figure 6C). 



DISCUSSION 

In this study, we show the evidence that PGC7 suppresses 
the enzymatic activity of TET2 and TET3 and protects 
5mC from oxidation. The interaction between PGC7 
and TET2 or TET3 is mediated by the DNA-binding 
domain of PGC7 and the catalytic domain of TET2 or 
TET3. Thus, it is likely that the DNA-binding domain 
of PGC7 directly blocks the activation sites of TET2 
and TET3, thus suppressing the enzymatic activity of 
TET2 and TET3. However, the DNA-binding domain of 
PGC7 does not interact with TET1. Interestingly, the 
homology between the catalytic domain of TET2 and 
TET3 is higher than that between these two TET 
enzymes and TET1. The major difference is in a variable 
region inside of the catalytic domain (13,42). However, 



only the variable regions from TET2 and TET3 do not 
interact with PGC7 (Supplementary Figure S7). It is 
possible that the variable region alone could not be cor- 
rectly folded. Future structural analysis will reveal the 
details of the interaction between PGC7 and TET2/3. 
Nevertheless, our study suggests that the different cata- 
lytic domains of TET enzymes determine the specific inter- 
actions with their binding partners. Of note, the 
interaction between PGC7 and TET2 or TET3 is similar 
to that between OGT and TET2 or TET3 (30,43,44). Like 
PGC7, OGT directly interacts with the catalytic domain of 
TET2 or TET3 (30,43,44). As both PGC7 and OGT 
interact with the catalytic domain of TET2 and TET3, 
we examined whether PGC7 and OGT are in one 
complex. However, we could not detect any interaction 
between OGT and PGC7 using IP and western blotting 
(Supplementary Figure S8). We further compared 
genome-wide distribution of OGT and PGC7, and did 
not observe the significant overlapping between OGT 
and PGC7-occupied genes, suggesting that the OGT/ 
TET2 and PGC7/TET2 complexes are functionally inde- 
pendent. It is possible that OGT and PGC7 compete with 
each other to interact with TET2/TET3. Moreover, OGT 
has been shown to functionally interact with TET2 for the 
transcription activation (30,43,44). Here, we found that 
PGC7 negatively regulates TET2 or TET3 by suppressing 
the enzymatic activity of TET2 or TET3. Further analysis 
of TET2 and TET3 complexes will reveal the functional 
correlation between PGC7 and OGT. 

Our analysis suggests that the DNA-binding domain of 
PGC7 not only blocks the enzymatic sites of TET2 and 
TET3 but also recognizes a consensus DNA sequence. 
Thus, the DNA-binding domain of PGC7 has dual func- 
tions. In somatic cells or other development stages, low 
levels of PGC7 bind consensus DNA sequence and protect 
surrounding CpG loci from TET enzyme-dependent DNA 
demethylation. It is possible that the relatively high levels 
of PGC7 in germ cells may not only protect DNA methy- 
lation at all the PGC7-binding loci, but additionally 
block TET3-mediated global DNA demethylation non- 
specifically after saturating the PGC7-binding sites on 
the chromatin. This is supported by the interesting 
phenomenon that the chromatin-associated TET3 is 
easily eluted from the chromatin at the maternal pro- 
nucleus following Triton X-100 treatment in the 
presence of high levels of PGC7 in zygotes (19). In 
addition, recent studies indicate that PGC7 recognizes 
H3K9me2 through its N-terminal tail (19). We examined 
the role of H3K9me2 in the recruitment of PGC7 to 
chromatin by reducing H3K9me2 with two different 
approaches (Supplementary Figure S9). Lacking 
H3K9me2 only slightly reduces the chromatin-bound 
PGC7, indicating that both DNA and H3K9me2 
function together to recruit PGC7 and the N-terminal 
tail of PGC7 is important for both DNA-binding and 
histone binding as well as protecting DNA methylation 
(Figure 6C). 

In this study, we have shown that PGC7 suppresses the 
enzymatic activity of both TET2 and TET3. Our genome- 
wide ChlP-Seq analyses indicate that PGC7 and TET3 
target genes are largely overlapped. However, a significant 
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portion of PGC7 targets are not occupied by TET3. As 
PGC7 also interacts with TET2, it is likely that PGC7 
adopts a similar mode to protect CpG methylation from 
TET2-dependent DNA demethylation in these loci. 
Moreover, PGC7 could not protect all the CpG methyla- 
tion from TET enzyme-mediated global DNA methyla- 
tion. In particular, DNA methylation in the repetitive 
sequence is only mildly reduced and a number of imprint- 
ing regions are not affected even lacking PGC7 in zygotes 



(28). Thus, other mechanisms may exist to protect the 
maternal pronucleus from TET enzyme-mediated global 
DNA demethylation. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online, 
including [45]. 
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